An ERB loudness pattern based objective speech quality measure

نویسندگان

  • Guo Chen
  • Vijay Parsa
  • Susan Scollie
چکیده

This paper presents an objective speech quality measure which is based on loudness patterns using the equivalent rectangular bandwidth (ERB) scale. The proposed measure, called the loudness pattern distortion (LPD), is computed from the differences between the loudness patterns of the original and processed speech. The LPD measure takes into account the transmission through the outer and middle ear, the calculation of an excitation pattern from the physical spectrum, and the transformation of an excitation pattern to a loudness pattern. The effectiveness of the proposed measure was demonstrated by experimental evaluations in comparison with the standard ITU-T P.862 (PESQ) using three coded speech database of the ITU-T P-series Supplementary 23.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of two objective speech quality measures: MBSD and ITU-T Recommendation P.861

The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1, 2]. The MBSD measure estimates speech distortion in loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD ove...

متن کامل

Perceptual aspects of voice-source parameters

Both in speech synthesis and in sound coding it is often beneficial to have a measure that predicts whether, and to what extent, two sounds are different. This chapter addresses the problem of estimating the perceptual effects of small modifications to the spectral envelope of a harmonic sound. A recently proposed auditory model is investigated that transforms the physical spectrum into a patte...

متن کامل

Improvement of MBSD by scaling noise masking threshold and correlation analysis with MOS difference instead of MOS

The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1][2]. The MBSD measure estimates speech distortion in the loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD...

متن کامل

A measure for predicting audibility discrimination thresholds for spectral envelope distortions in vowel sounds.

Both in speech synthesis and in sound coding it is often beneficial to have a measure that predicts whether, and to what extent, two sounds are different. This paper addresses the problem of estimating the perceptual effects of small modifications to the spectral envelope of a harmonic sound. A recently proposed auditory model is investigated that transforms the physical spectrum into a pattern...

متن کامل

Singing in groups for Parkinson's disease (SING-PD): a pilot study of group singing therapy for PD-related voice/speech disorders.

Parkinson's disease related speech and voice impairment have significant impact on quality of life measures. LSVT(®)LOUD voice and speech therapy (Lee Silverman Voice Therapy) has demonstrated scientific efficacy and clinical effectiveness, but musically based voice and speech therapy has been underexplored as a potentially useful method of rehabilitation. We undertook a pilot, open-label study...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006